## X fixed.acidity volatile.acidity citric.acid
## Min. : 1.0 Min. : 4.60 Min. :0.1200 Min. :0.000
## 1st Qu.: 400.5 1st Qu.: 7.10 1st Qu.:0.3900 1st Qu.:0.090
## Median : 800.0 Median : 7.90 Median :0.5200 Median :0.260
## Mean : 800.0 Mean : 8.32 Mean :0.5278 Mean :0.271
## 3rd Qu.:1199.5 3rd Qu.: 9.20 3rd Qu.:0.6400 3rd Qu.:0.420
## Max. :1599.0 Max. :15.90 Max. :1.5800 Max. :1.000
## residual.sugar chlorides free.sulfur.dioxide
## Min. : 0.900 Min. :0.01200 Min. : 1.00
## 1st Qu.: 1.900 1st Qu.:0.07000 1st Qu.: 7.00
## Median : 2.200 Median :0.07900 Median :14.00
## Mean : 2.539 Mean :0.08747 Mean :15.87
## 3rd Qu.: 2.600 3rd Qu.:0.09000 3rd Qu.:21.00
## Max. :15.500 Max. :0.61100 Max. :72.00
## total.sulfur.dioxide density pH sulphates
## Min. : 6.00 Min. :0.9901 Min. :2.740 Min. :0.3300
## 1st Qu.: 22.00 1st Qu.:0.9956 1st Qu.:3.210 1st Qu.:0.5500
## Median : 38.00 Median :0.9968 Median :3.310 Median :0.6200
## Mean : 46.47 Mean :0.9967 Mean :3.311 Mean :0.6581
## 3rd Qu.: 62.00 3rd Qu.:0.9978 3rd Qu.:3.400 3rd Qu.:0.7300
## Max. :289.00 Max. :1.0037 Max. :4.010 Max. :2.0000
## alcohol quality
## Min. : 8.40 Min. :3.000
## 1st Qu.: 9.50 1st Qu.:5.000
## Median :10.20 Median :6.000
## Mean :10.42 Mean :5.636
## 3rd Qu.:11.10 3rd Qu.:6.000
## Max. :14.90 Max. :8.000
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# Quality Histogram most of the wine are quality of 5 and 6
## Scale for 'x' is already present. Adding another scale for 'x', which
## will replace the existing scale.
the sulphates histogram has outliers it shows that there are some wine with more sulphate values
sugar value of wines lies between 1.5 and 2.5 it clearly states that most of wine have less sugary
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
acidity of wine is bit skewed left show that most of the wine are less acidity
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
most of the wine are less suagary # Bivariate analysis
in the above line plot we can see that there is constant updown of desity so it does not make any sense
the higly disturbed desity graph is converted into sensable graph by
grouping according towards quality it also clearly depicts that high
quality wine has high density value
## `geom_smooth()` using method = 'gam'
the above scatter plot clearly shows that citric acid causes the
acidity the outlier is also removed for the better understanding of data
## `geom_smooth()` using method = 'gam'
the above scatter plot clearly shows that lesser ph value have more acidity
the bar plot shows that high quality wine have relatively high alcohol content than lower quality wine
the above clearly states that low quality wine has vinegar taste but when we see quality 7 and 8 there is no significant change
most of wine has same level of saltiness except 3 becuase it has a big quadrant
here we can see that if we have more sulphur which also leads to more
free sulphur but in this dataset all the point clouded in the lower
point and ti shows that most of wine are less sulfur
## `geom_smooth()` using method = 'gam'
there is coorelation between density and alcohol hihger the density lower the alcohol content
the lowe the sulfur leads to lower the total sulfur but sulphur are
clouded in bottom shows that most of the wine are less in sulfur #
multivariate analysis
the above multivariate plot show that quality 5 wine having high acidity when compare to 7
we can see that there is a corelation between acidity and pH value . i
think citric acid is the reason for acidity because lower citric acid
has less citric acid and high ph value
the above grid plot clearly shows that quality 5 and 6 has more sulphur
and also shows that it has less citric acid which leads to less acidiy
## `stat_bindot()` using `bins = 30`. Pick better value with `binwidth`.
the wine quality 5 and 6 has high acidity and wide range of density but
3,4,7,8 has less acidity but 8 has less acidity and density
the above plot show that lesser density has higher alcohol content
but quality 7 having lesser density abd highr alchol content
ggplot(aes(x=total.sulfur.dioxide,y=free.sulfur.dioxide),data=wineData) +
geom_point()+
facet_grid(~quality)
the streak of length of sulphur is less in lesser quality wine shows
that lower quality wine has less in sulfur # Final Plot 1) ## Count of
Quality of Wine
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# description the plot clearly shows that we don’t have sufficent data
of all quality only 5 and 6 are in high volume so taking descision we’ll
be difficult 2) ## quality on acidiy vs pH
# Description we can see that high quality wine has less acidity but pH
values is wide spreded this makes me to think the relation between ph
and Volatile Acidity so volatile Acidity not affecting the pH value that
much 3) ## citrus level on acidity vs pH
# description it clearly shows that high citrus level leads to high
acidity and low pH value likewise low citrus values gives low acidity
and pH value